Accelerating Reinforcement Learning by Mirror Images
نویسندگان
چکیده
あらまし 本研究では,強化学習の代表的な手法の Q学習を使用して,追跡問題のための学習速度の向上手法を提案 する.本研究のアイデアは,鏡像による対称性を利用して,フィールドの Q値を学習することにある.このことで左 右の対象差のみを伴う学習をすることが可能である.また,Q値の同時更新による収束性についても論じる. In this investigation we propose how to accelerate Q-learning which is one of the most successful reinforcement learning methods using mirror images for hunting problems. Mirror images have symmetric differences on right and left views, they allow us to accelerate Q-learning dramatically. In addition, we show Q-values’ convergence while they are kept updated during the process. キーワード 強化学習,Q学習,同時更新,鏡像
منابع مشابه
Mirror Descent Search and Acceleration
In recent years, attention has been focused on the relationship between black box optimization and reinforcement learning. Black box optimization is a framework for the problem of finding the input that optimizes the output represented by an unknown function. Reinforcement learning, by contrast, is a framework for finding a policy to optimize the expected cumulative reward from trial and error....
متن کاملA Probabilistic Reinforcement-Based Approach to Conceptualization
Conceptualization strengthens intelligent systems in generalization skill, effective knowledge representation, real-time inference, and managing uncertain and indefinite situations in addition to facilitating knowledge communication for learning agents situated in real world. Concept learning introduces a way of abstraction by which the continuous state is formed as entities called concepts whi...
متن کاملAssociative Neural Models for Biomimetic Multi- Modal Learning in a Mirror Neuron-based Robot
By using neurocognitive evidence on mirror neuron system concepts the MirrorBot project has developed neural models for intelligent robot behaviour. These models employ diverse learning approaches such as reinforcement learning, self-organisation and associative learning to perform cognitive robotic operations such as language grounding in actions, object recognition, localisation and docking. ...
متن کاملMyocardial fibrosis delineation in late gadolinium enhancement images of Hypertrophic Cardiomyopathy patients using deep learning methods
Introduction: Accurate delineation of myocardial fibrosis in Late Gadolinium Enhancement on Cardiac Magnetic Resonance (LGE-CMR) has a crucial role in the assessment and risk stratification of HCM patients. As this is time-consuming and requires expertise, automation can be essential in accelerating this process. This study aims to use Unet-based deep learning methods to automate the mentioned ...
متن کاملAccelerating Action Dependent Hierarchical Reinforcement Learning Through Autonomous Subgoal Discovery
This paper presents a new method for the autonomous construction of hierarchical action and state representations in reinforcement learning, aimed at accelerating learning and extending the scope of such systems. In this approach, the agent uses information acquired while learning one task to discover subgoals for similar tasks by analyzing the learned policy using Monte Carlo sampling. The age...
متن کامل